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REVIEW 



Recognition of lipopolysaccharide pattern by TLR4 
complexes 

Beom Seek Park^ and Jie-Oh Lee^ 

Lipopolysaccharide (LPS) is a major component of the outer membrane of Gram-negative bacteria. IVIinute amounts of LPS 
released from infecting pathogens can initiate potent innate immune responses that prime the immune system against further 
infection. However, when the LPS response is not properly controlled it can lead to fatal septic shock syndrome. The common 
structural pattern of LPS in diverse bacterial species is recognized by a cascade of LPS receptors and accessory proteins, LPS 
binding protein (LBP), CD14 and the Toll-like receptor4 (TLR4)-MD-2 complex. The structures of these proteins account for 
how our immune system differentiates LPS molecules from structurally similar host molecules. They also provide insights useful 
for discovery of anti-sepsis drugs. In this review, we summarize these structures and describe the structural basis of LPS 
recognition by LPS receptors and accessory proteins. 
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INTRODUCTION 

In the initial phase of infection, the innate immune system 
generates a rapid inflammatory response that blocks the 
growth and dissemination of the infectious agent. This 
response is followed, in vertebrates, by an adaptive immune 
response that leads to its elimination. The adaptive immune 
receptors are highly specific for the particular pathogen that 
induces them. Unlike the adaptive response, the innate 
immune response, mediated mainly by macrophages and 
dendritic cells, involves a limited number of germline-encoded 
pattern recognition receptors that recognize common struc- 
tural patterns among diverse molecules produced by microbes. 
Among these receptors, the representative key players are 
proteins of the Toll-like receptor (TLR) family. ^'^ 

Lipopolysaccharide (LPS) is a well-characterized pathogen- 
associated molecular pattern found in the outer leaflet of the 
outer membrane of most of the Gram-negative bacteria. It can 
initiate a strong immune response and serves as an early 
warning signal of bacterial infection. LPS is initially extracted 
fi'om bacterial membranes and vesicles released from them by 
LPS binding protein (LBP) in serum. LBP then transfers LPS to 
CD 14, which can be found either in soluble form or linked to 



the cell surface by a glycosylphosphatidylinositol anchor. CD 14 
splits LPS aggregates into monomeric molecules and presents 
them to the TLR4-MD-2 complex. Aggregation of the TLR4- 
MD-2 complex after binding LPS leads to activation of multiple 
signaling components, including NF-kB and IRF3, and the 
subsequent production of pro -inflammatory cytokines.^"^ 

The progress of genome sequencing projects has led to 
the identification of 13 TLRs in mammalian genomes, 10 
in humans and 12 in mice. The TLRs are type I integral 
membrane receptors composed of an extracellular domain, a 
single transmembrane helix and an intracellular signaling 
domain. The extracellular domain has a horseshoe-shaped 
solenoid structure and is responsible for the recognition of 
common structural patterns in various microbial molecules. 
For example, lipoproteins or lipopeptides are recognized by 
TLR2 complexed with TLRl or TLR6, viral double- stranded 
RNAs by TLR3, LPSs by the TLR4-MD-2 complex, bacterial 
flagellins by TLRS, viral and bacterial single-stranded RNAs by 
TLR7 or TLRS and CpG-rich under- methylated DNAs by 
TLR9. Binding of these ligands results in the formation of large 
receptor-adaptor complexes that lead to production and 
secretion of cytokines.^"^ 
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In this review, we discuss the reported structures of the 
TLR4-MD-2 complex, LBP and CD14, and consider their 
implications for our understanding of TLR4-MD-2 activation 
in response to recognition of LPS. 

STRUCTURE OF LPS 

LPS is a macromolecular glycolipid composed of a hydro- 
phobic lipid A region attached to a long and branched 
carbohydrate chain. The lipid A portion, which is responsible 
for most of the immunologic activity of LPS, is composed of a 
phosphorylated diglucosamine backbone with four to seven 
acyl chains attached to it. Four of the acyl groups are directly 
linked to the 2, 3, 2' and ?>' positions of the glucosamine 
backbone, and the remaining two are attached to the hydroxyl 
groups of the lipid chains. Lipid A from different bacterial 
species shows substantial structural diversity.^' The number 
and length of the acyl chains can vary, and the phosphate 
groups can be modified by other chemical groups. 

The carbohydrate region of LPS can be divided into two 
areas, the core and the O-specific chain. The core region is 
relatively conserved among bacterial species and contains 
unusual carbohydrate residues such as heptose and KDO that 
are not usually found in host cells. The O-specific region is 
composed of many copies of carbohydrate repeating units. 
Bacterial cells produce a highly heterogeneous set of repeating 
units with different structures. Removal of the entire carbo- 
hydrate chain by acid hydrolysis has only a minimal effect on 
the inflammatory activity of LPS, demonstrating that the core 
and O-specific carbohydrates have only a minor role in 
recognition by host immune receptors.^' 

STRUCTURES OF LBP AND CD14 

LPS molecules, due to their amphipathic nature, form large 
aggregates in aqueous environments above a critical micellar 
concentration. The accessory proteins, LBP and CD 14, 
enhance the detection of LPS by the TLR4-MD-2 complex 
by extracting and monomerizing LPS before its presentation to 
TLR4-MD-2. LBP is an acutely induced plasma protein that 
binds avidly to LPS aggregates and delivers them to CD 14.^^'^^ 
It belongs to the lipid transfer or LBP family. Other members 
of the family are bacterial and permeability- increasing protein 
(BPI), cholesterol ester transfer protein, phospholipid transfer 
protein and a few poorly characterized proteins. Of 
these proteins, the structures of BPI and cholesterol ester 
transfer protein have been determined; ^^'^^ the structure of 
LBP has not been reported but it is expected to share the 
general features of BPI because the two proteins have 48% 
sequence homology. BPI is a boomerang-shaped molecule 
composed of a central |3 sheet with barrel- shaped domains at 
its termini (Figure la).^^'^^ However, the two proteins differ 
functionally: LBP transfers LPS to TLR4-MD-2, whereas BPI 
does not. Structural studies are required to account for the 
functional difference between the two proteins. 

CD 14 is expressed on the surface of myelomonocytic cells in 
the form of a glycosylphosphatidylinositol-linked glycoprotein, 
and as a soluble protein in the serum.^^ Its crucial role in LPS 




CD14 



Figure 1 Tlie structures of accessory proteins involved in LPS 
recognition, (a) Tlie crystal structure of BPI, with two phospholipid 
binding sites. LBP is expected to have a similar structure, 
(b) CD14 forms homodimers. The monomeric subunit of CD14 
contains 11 LRR modules and a single LRRNT module. 

signaling has been confirmed using knock-out mice: CD14- 
deficient mice are highly resistant to septic shock initiated by 
injection of LPS or live bacteria.^^ CD 14 binds to LPS 
delivered by LBP and transfers the bound LPS to the TLR4- 
MD-2 complex. Since the presumed LPS binding pocket of 
CD 14 is too small for large LPS aggregates, it is likely to bind 
the monomeric form of LPS. 

CD 14 belongs to the leucine- rich repeat (LRR) family, 
and has the characteristic curved solenoid structure 
(Figure Ib).^^'^^ LRR family proteins are composed of 
multiple copies of LRR modules. The individual LRR 
modules consist of 20-30 amino-acid residues with highly 
conserved 'LxxLxLxxN' motifs. The central LxL part of the 
module forms the core of a (3 strand; the two leucines point 
toward the interior of the protein, forming the hydrophobic 
core, whereas the variable x residues within the motif are 
exposed to solvent and some are involved in interactions with 
ligands. Asparagines in the motif make stable hydrogen bonds 
with the backbone carbonyls of neighboring (3 strands 
throughout the entire protein, forming an extended hydrogen 
bonding network called an 'asparagine ladder'. As a result, the 
P strands are more closely packed, and assemble into a large (3 
sheet making up the entire concave surface of the horseshoe. 
Variable amino acids outside the conserved (3 strands of each 
LRR module are surface exposed and some of them have 
important roles in ligand interactions. To prevent exposure of 
the hydrophobic core of the LRR modules, there are two 
special modules named LRRNT and LRRCT at the N- and 
C-termini of the proteins. These modules do not show the 
sequence conservation pattern of the LRR modules and often 
contain an anti-parallel (3 hairpin stabilized by disulfide 
bridges.^^ 
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CD 14 exists as a homodimer.^^ Unlike other LRR family 
members, it does not contain an LRRCT module protecting its 
hydrophobic core. Instead, the C-terminus of the LRR modules 
of one CD 14 molecule interacts with the C-terminus 
of another, forming a dimer. The LPS interaction pocket of 
CD 14 is located at the boundary of the LRRNT and the first 
LRR module (Figure lb). In addition to LPS, CD 14 can bind 
other microbial products, such as peptidoglycan, lipoteichoic 
acid, lipoarabinomannan and lipoproteins. Therefore, it has 
broad ligand specificity and functions as a pattern recognition 
receptor by recognizing structural motifs in diverse microbial 
products.^^'^^ 

STRUCTURE OF THE TLR4-MD-2 COMPLEX 

The extracellular domain of TLR4 also belongs to the LRR 
family (Figure 2) and is responsible for ligand binding and 
receptor dimerization.^^'^^ The structure of TLR4, as well as 
those of TLRl, TLR2 and TLR6, deviates substantially from 
the consensus LRR structures.^'^'^^ LRR family proteins can be 
classified into seven subfamilies. Each subfamily is defined by 
its conformation, which is a function of the twist, tilt, rotation 
angles and radii of the central P sheet of the LRR motifs. Most 
of the LRR proteins have uniform radii and (3 sheet angles 
throughout the protein: for example, typical subfamilies have 
24 amino acids per module and 3io helices and loops in the 
convex area. However, TLRs 1, 2, 4 and 6 are divided by a 
structural transition into three subdomains: N- terminal, 
central and C-terminal. The structural discontinuities seem 
to be caused by irregular LRR sequences in the central domain. 
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Figure 2 Overview of LPS recognition by TLR4-l\/ID-2. LPS binding 
induces dimerization of tlie TLR4-l\/ID-2 complex, wliicli is 
proposed to enable dimerization of the intracellular TIR domains 
and recruitment of adaptor molecules such as MyD88. Aggregation 
of the death domains (DD) of MyD88 brings four IRAK4 and four 
IRAK2 molecules together forming a large tower-like structure 
called the 'Myddosome'. 



Interestingly, the domain boundaries in TLRs 1, 2 and 6 fulfill 
important roles in ligand binding.^^"^^ In the case of TLR4, its 
N-terminal and central domains clearly provide charge 
complementarity for binding of its surface to its co-receptor 
MD-2, forming a stable 1:1 heterodimer. 

MD-2 is smaller than TLR4 and is the main LPS binding 
module of the TLR4-MD-2 receptor complex. MD-2 has a [3- 
cup fold structure, composed of two antiparallel P sheets.^^'^^ 
The two sheets are separated from each other on one side of 
the sandwich-like structure so that the hydrophobic interior is 
exposed for interaction with ligands. This large internal pocket 
is ideally shaped for binding flat hydrophobic ligands like LPS. 
The (3-cup fold of MD-2 is shared by a small number of 
proteins mostly involved in lipid storage or transport.^^ This 
fold is very similar to the more famous immunoglobulin fold. 
However, it does not have the conserved disulfide bridge 
connecting the two (3 sheets of the immunoglobulin fold. 
Separation of the two (3 sheets and exposure of the internal 
pocket in the P-cup fold is possible because the connecting 
disulfide bridge is missing.^^'^^ 

MD-2 binds to TLR4 primarily via hydrophilic interactions 
such as hydrogen bonds and charge interactions, and a few 
hydrophobic residues are found in the binding interface.^^'^^ 
The long and narrow binding surface of TLR4 can be divided 
into two areas, the A and B patches. The negatively charged A 
patch interacts with the positively charged surface of MD-2, 
and the positively charged B patch with the negatively charged 
surface of MD-2. Mutation of residues involved in this 
interaction blocks TLR4 and MD-2 binding, and LPS 
signaling. 

LPS BINDING TO THE TLR4-MD-2 COMPLEX 

The crystal structure of the TLR4-MD-2 complex bound to a 
truncated form of £. coll LPS has been determined.^^ LPS 
binding induces the formation of the 'm' shaped 2:2:2 TLR4, 
MD-2 and LPS complex, with the two C-termini of the 
extracellular domains of TLR4 converging in the center and 
the N-termini stretching outwards (Figure 2). The acyl chains 
of the lipid A are inserted deep into the MD-2 pocket and the 
two phosphate groups of lipid A form charge and hydrogen 
bond interactions with charged residues in MD-2 and TLR4. 
Before it binds to LPS, MD-2 forms a stable heterodimer with 
the extracellular domain of TLR4. LPS binding causes dimer- 
ization of the TLR4-MD-2 complex because the LPS creates an 
additional binding interface between TLR4 and MD-2. 
The former interaction interface is named the primary inter- 
face, which is formed between one edge of MD-2 and part of 
its concave surface by the extensive network of charge- 
enhanced hydrogen bonds of TLR4, while the latter, dimeriza- 
tion interface of MD-2 is located opposite the primary 
interface and interacts with a convex surface provided by a 
small hydrophobic patch in the C-terminal domain of the 
other TLR4 in the TLR4-MD-2-LPS complex (Figure 3a). 

The structure-activity relationships of LPS have been 
studied for decades using natural and chemically modified 
LPS.^^ The crystal structure of TLR4-MD-2-LPS provides an 
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a Primary interface |_p5 Dimerizatiori interface 
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Figure 3 Binding of LPS and antagonistic ligands to tlie TLR4- 
l\/ID-2 complex, (a) Structure of tlie primary and dimerization 
interfaces of tlie TLR4-l\/ID-2-LPS complex. The lipid chains of 
LPS are labeled. MD-2 is colored grey. The lipid chains and 
phosphate groups of LPS are shown in red. The glucosamine 
backbone is pink, (b) Structures of Eritoran and lipid IVa bound to 
MD-2. 

explanation for why LPS with six Hpid chains is optimal for 
activation of TLR4 signaling. In the crystal structure, five of 
the six lipid chains of E coll LPS are completely buried inside 
the pocket, but the remaining chain is partially exposed to the 
MD-2 surface and forms the hydrophobic interaction interface 
together with hydrophobic surface residues of MD-2. 

Two phosphate groups attached to the glucosamine of LPS 
further support formation of the stable TLR4-MD-2 complex 
by making charge and hydrogen bond interactions simulta- 
neously with the two TLRs in the complex. Removal of one of 
the phosphate groups greatly reduces the inflammatory 
activity of LPS. This modified form of LPS is named 
monophosphorylated lipid A; it has recently been approved 
as a vaccine adjuvant because it retains strong immune 
stimulatory activity but has lost most of the inflammatory 
toxicity of LPS.^^~^^ Since the core carbohydrate part of £. coll 
LPS makes only a limited number of weak ionic and hydrogen 
bonds with TLR4 and MD-2, the carbohydrate chain should 
have only a minor role in the immunological activity of LPS.^^ 



Two single-nucleotide polymorphisms of human TLR4, 
D299G and T399I have been identified and are thought to 
be associated with LPS hyporesponsiveness. Recently, Ohto 
et al?^ determined the structure of polymorphic TLR4 in 
complex with MD-2 and LPS. They found that the alterations 
in their structure are quite localized, and the relative 
orientation and distance between the TLR4 subunits is not 
changed. They therefore proposed that the conformational 
changes induced by the two polymorphisms do not directly 
influence conformation of the TLR4-MD-2 complex, but 
rather reduce the response to LPS by affecting binding affinity, 
the folding efficiency, cell surface expression or stability of 
TLR4. 

BINDING OF ANTAGONIST TO THE TLR4-MD-2 
COMPLEX 

Some lipid A derivatives with four lipid chains have antag- 
onistic activity for the TLR4-MD-2 complex.^^'^^ The 
structures of two of these antagonists, Eritoran (or E5564) 
and lipid IVa, in complexes with TLR4-MD-2 and MD-2, 
respectively, have been determined. ^^'^^ Eritoran is a synthetic 
molecule derived from the lipid A component of the non- 
pathogenic LPS of Rhodobacter sphaeroides. It is a high affinity 
antagonist and has useful therapeutic activity against severe 
sepsis.^^'^^ Lipid IVa, a precursor form of LPS, acts as an 
antagonist of human TLR4-MD-2. When bound to MD-2, 
Eritoran and lipid IVa adopt remarkably similar structures 
(Figure 3b). Their glucosamine backbones are nearly super- 
imposable, despite structural differences between their lipid 
chains.^^ The lipid chains of both are packed inside the 
hydrophobic pocket in MD-2. Neither Eritoran nor lipid IVa 
induces dimerization of human TLR4-MD-2 because all four 
lipid chains are completely submerged inside the pocket and 
cannot provide a hydrophobic dimerization surface that can be 
used for interaction with TLR4. 

Although lipid IVa is an antagonist to human TLR4-MD-2, 
it has weak agonistic activity for the mouse TLR4-MD-2 
complex.^ ^ To explain this peculiar species- specific response, 
the structure of the lipid IVa bound to mouse TLR4-MD-2 
complex has been determined.^^ In this structure, the positions 
of the glucosamine backbone and the phosphate groups are 
nearly identical with those of LPS bound to TLR4-MD-2, and 
therefore it can induce dimerization of TLR4-MD-2. However, 
because lipid IVa has only four lipid chains, the hydrophobic 
pocket of MD-2 has a large unoccupied space in this structure. 
This arrangement of acyl chains is energetically unfavorable, 
which can explain why lipid IVa has only weak agonistic 
activity. 

ACTIVATION OF THE TLR4-MD-2 COMPLEX 

The TLR components of aU TLR-ligand complexes have 
similar dimeric conformations, with the two C-termini of 
the extracellular domains of the TLRs converging in the 
middle.^^'^^"^^ This structural arrangement supports the 
hypothesis that dimerization of the extracellular domains 
leads to juxtaposition of the intracellular domains, signaling 
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adaptors being then recruited to these dimerized intracellular 
domains. The intracellular domains of TLRs have substantial 
sequence and structural homology to the interleukin- 1 
receptor (IL-IR) family, and are named Toll/IL-IR homology 
(TIR) domains. TIR domains are not only found in the 
receptors but also in the signaling adaptors involved in TLR 
and IL-IR signaling.^ After binding LPS, the TIR domain of 
TLR4 interacts with the TIR domain of myeloid differentiation 
factor 88 (MyD88), in conjunction with another TIR- 
containing adaptor protein, referred to as MyD88 adaptor- 
like (Mai). Mutations of the TIR domains abrogate this 
interaction, demonstrating that the latter are responsible for 
this interaction.^-^"^^ 

MyD88 has been reported to be involved in the signaling 
pathways initiated by all TLRs with the exception of TLR3.^^ 
Of the MyD88- dependent pathways, only those involving 
TLR2 and TLR4 require Mai for efficient signaling. The 
MyD88 aggregation signal is transmitted to IL-1 receptor 
kinase (IRAK) through an interaction between the death 
domain of MyD88 and IRAK. Phosphorylation of the 
signaling kinases eventually activates the transcription 
factors, NF-kB and activator protein 1 via a signaling 
cascade. Instead of the MyD88-dependent pathway, the TRIP 
(TIR- domain containing adaptor inducing interferon- (3) 
pathway of TLR4 activates interferon response factors to 
produce and secrete type- 1 interferons. TIR domains are 
found not only in mammalian proteins but also in plant 
and bacterial proteins. Sequence conservation in the TIR 
family is relatively low, in the 20-30% range, and domain 
lengths vary between 135 and 160 residues because of large 
deletions and insertions in the loop regions.^ ^ 

Crystal structures of TIR domains have been reported for 
TLRs 1, 2 and 10, IL-IRAPL, MAL and MyD88, and for TIR 
domains of several plant and bacterial proteins (Figure 4a). 
In addition, an NMR structure has been reported for the 
MyD88 TIR domain.^^ In all these structures, alternating P 
strands and a helices are arranged in the form of five central 
parallel (3 strands surrounded by five a helices. Homo- 
multimeric TIR interfaces have been observed in crystals of 
TLR2 and TLR 10, but the functional relevance of these 
interactions remains uncertain because dimerization of these 
TIR domains has not been observed in solution. 

On the basis of the crystal structures of these monomeric 
TIR domains, mutational studies and computational modeling 
have been carried out to predict the homo- and heteromeric 
structures of TIR complexes.^ ^'^^'^^ The results suggest that the 
BB and the DD loops of the TLR TIR domains have key roles 
in TIR domain aggregation and signal activation. The BB loop 
that joins strands (3B and helix aB extends away from the rest 
of the TIR domain, forming a protrusion from the surface 
(Figure 4b). The DD loop connects the PD strand and the aD. 
The structures of the BB and DD loop regions vary consider- 
ably in different TIR structures. 

MyD88 contains not only a TIR domain but also a death 
domain. Its death domain binds to downstream IRAKs after 
the activation of TLRs. Recently, Lin et al.^^ determined the 



structure of the death domain complex of MyD88, IRAK4 and 
IRAK2. In this tower- like structure, the death domains form a 
large signaling complex, called a Myddosome, which consists 
of a four-layered left-handed helical arrangement of six 
MyD88, four IRAK4 and four IRAK2 death domains. On 
the basis of this structure, they proposed that formation of the 
Myddosome brings the kinase domains of the IRAKs into 
proximity so that they can be phosphorylated and activated. 

STRUCTURES OF OTHER TLRS 

To date, the structures of six of the ten human TLRs in 
complex with their physiological or synthetic ligands have 
been reported (Figure 5). TLR2 is unique among human TLRs 
because it can form heterodimers with other TLRs, TLRl and 
TLR6. The principal ligands of the TLR1-TLR2 complex are 
triacyl lipopeptides, the interaction with diacyl lipopeptides 
being substantially weaker.^ ^'^^ By contrast, the TLR2-TLR6 
complex is able to bind to diacyl lipopeptides. These lipo- 
proteins and lipopeptides are functionally and structurally 
diverse bacterial proteins anchored to the membrane by two or 
three covalently attached lipid chains.^^ 

The structures of TLR1-TLR2 and TLR2-TLR6 with the 
synthetic lipopeptide ligands, Pam3CSK4 or Pam2CSK4, have 
been determined.^^'^^ The overall shapes of the complexes are 
similar to that of TLR4 dimers: the two N-terminal regions 
stretch outward and the C-termini of the extracellular domains 
converge in the middle. The Pam3CSK4 binding pockets of 
TLRl and TLR2 are formed at the boundaries between the 
central and C-terminal domains. The three lipid chains of 
Pam3CSK4 mediate TLR2-TLR1 heterodimer formation: two 
are inserted into the TLR2 pocket, and the remaining, amide- 
bound, lipid chain is inserted into the narrower TLRl channel. 
Heterodimers of TLRl and TLR2 are further stabilized by 
protein-protein interactions near the opening of the ligand 
binding pocket. 

Whereas the TLR1-TLR2 complex recognizes triacylated 
lipopeptides, the TLR2-TLR6 complex recognizes the diacy- 
lated ligand, Pam2CSK4, which lacks the lipid chain that binds 
in the hydrophobic pocket of TLRl. Nevertheless, the structure 
of the TLR2-TLR6-Pam2CSK4 complex indicates that TLR6 
and TLRl form m-shaped structures very similar to the TLRl- 
TLR2-Pam3CSK4 complex. However, there are important 
structural differences between TLRl and TLR6 in their ligand 
binding and dimerization regions. In TLR6, the side chains of 
two phenylalanine residues block the lipid binding channel, 
creating a channel that is less than half the length of that of 
TLRl.^^ This structural feature explains the selectivity of the 
TLR2-TLR6 complex for diacylated over triacylated lipopep- 
tides. Despite the absence of the amide-bound lipid chain in 
the diacylated lipopetide, TLR2 can still form a stable 
heterodimer with TLR6 because the lack of the lipid-protein 
interaction is compensated for by expanding the hydrophobic 
protein-protein interface. 

TLR3 has been shown to recognize double-strand UNA 
(dsRNA), which is produced by various viruses at some point 
in their life cycle and is a potent danger signal of virus infection.^^ 
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Figure 4 TIR domain structures, (a) Tlie figure was drawn using coordinate files witli PDB codes as follows: IFYV (TLRl), IFYW (TLR2), 
2J67 (TLRIO), 2Z5V (MyD88) and 2Y92 (Mai). The backbones of the TIR domains are yellow; positive charges are blue and negative 
charges red. (b) Sequence alignment of TIR domains. The BB and DD loops important in TLR signaling are boxed in red. 
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Figure 5 Structures and a phylogenetic tree of TLRs. Crystal 
structures of TLR4-MD-2-LPS, TLR2-TLRl-Pam3CSK4, TLR2- 
TLR6-Pam2CSK4, TLR5-flagellin, TLR3-dsRNA, TLR8-CL097 are 
shown. The ligands are colored red, and TLRs are blue and green. 



TLR3 has the largely uniform and flat LRR structure typical of 
the subfamily, but lacks the subdomain structure seen in TLRl, 
TLR2, TLR4 and TLR6.^^'^5 It is heavily glycosylated, with 15 
predicted N-glycosylation sites, of which 1 1 are visible in the 
crystal structure. The dsRNA interacts with both N-terminal 
and C-terminal sites on the lateral side of the convex surface 
of TLR3.^^ The positively charged residues near the N- 
and C -termini of the extracellular domain of TLR3 make 
the major contribution to the interaction with the sugar- 
phosphate backbones of the dsRNA. Mutational analyses 
have established that simultaneous binding to the N- and 
C-terminal sites is required for stable binding of the 
dsRNA. TLR3 interacts with the ribose-phosphate backbone 
of dsRNA, thus accounting for the lack of RNA sequence 
specificity. The TLR3 structure can also explain the strong 
pH dependence of RNA binding: several histidines make 
crucial bonds with the phosphate backbones of the RNA, 
and their protonation appears to be essential to stabilize the 
interaction. 

TLRS is one of the few TLRs that recognize a protein 
pathogen-associated molecular pattern, namely bacterial 
flagellin, which is the monomeric subunit of the flagellar 
fiber.^^ The crystal structure of zebrafish TLRS has been 
determined in a complex with a truncated fragment of 
Salmonella FliC.^^ The overall shape and curvature of the 
TLRS LRR is better categorized as TLR3-like than TLR4- or 
TLR2-like, which is consistent with the evolutionary proximity 
of TLR3 and TLRS (Figure S). The structure of the TLRS-FliC 
complex shows that the flagellin Dl domain has the dominant 
role in the binding and dimerization of TLRS. The Dl domain 
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consists of four elongated segments that assemble vertically 
into a long, largely helical, rod. The four segments consist of 
two N-terminal oc helices, one C-terminal a helix and a 
segment of P hairpin. TLR5 interacts primarily with the three 
helices of the Dl domain via its side, which is formed by the 
segment between LRRNT and LRRIO. At the same time, the 
Dl domain interacts with the second TLR5 molecule in the 
dimer and therefore bridges the two TLRs. The two TLR5 
molecules in the dimer also closely resemble the m-like 
structures of the other TLR dimers. 

TLR7 and TLRS recognize the single- strand RNA of various 
viruses, as well as small- interfering RNAs. They also mediate 
the recognition of self RNA released from dead or dying cells 
and are responsible for certain autoimmune diseases. In 
phylogenetic analyses, TLR7, 8 and 9 form a subfamily with an 
amino-acid sequence longer than other TLRs (Figure 5). 
In 2013, the crystal structures of unliganded and ligand- 
induced activated human TLRS dimers were reported.^^ Small 
molecule agonists, R848, CL097 and CL075, were used in that 
crystallographic work. The overall TLRS monomer structure 
resembles a ring and contains 26 LRR, which is the largest 
number of LRR modules among the TLRs. It contains a long 
region of ^40 amino acids between LRR14 and LRR15, which 
is referred to as the Z-loop. This can be cleaved by unidentified 
proteases, splitting TLRS into half and generating fragments 
of 60 and SOkDa within the cell. This proteolytic step has 
been proposed to be essential for the maturation and sorting 
of TLR9.68-70 

Unlike other TLRs, TLRS forms homodimers even before 
binding agonistic ligands. In the absence of bound ligand, the 
C-terminal ends of the two TLRS subunits in the dimer are 
separated by ^ 53 A; upon ligand binding, the two C-termini 
are brought into closer proximity, and so should be able to 
promote close approximation of the intracellular TIR domains. 
This structural change is induced by several interactions 
between TLRS and the bound ligand. The most important 
residues in this interaction are Phe405 and Asp543, which 
form a n stack interaction with benzene rings of the 
imidazoquinoline group of RS4S or the thiazoloquinolone 
groups of CL097 and CL075, and form hydrogen bonds with 
the nitrogen atoms of the imidazole and thiazole moieties. 
Thus, acting as a kind of molecular glue, the bound ligands 
activate TLRS by rearranging the preformed receptor dimer. 

THE HYBRID LRR TECHNIQUE 

Structure determination of TLR family proteins has been a 
major challenge in the study of innate immunity because 
crystallization of these proteins is not trivial. This obstacle was 
overcome by a novel crystallization technique named the 
'hybrid LRR technique' (Figure 6)?^'^^ In this method, a 
panel of hybrids of target and partner LRR family proteins is 
generated. The hybridization partner proteins are chosen from 
a set of LRR family proteins that are easy to produce and 
crystallize. The LRR modules of the pairs of proteins are 
joined while conserving the sequence of the modules. Due 
to the highly modular nature of LRR family proteins this 



a 




b 



Hybrid 



TLR4 

Partner 2 ^ 




Figure 6 The hybrid LRR technique. Two LRR family proteins 
are fused together while maintaining the sequence conservation 
pattern of the LRR modules, (a) The functional domain of 
TLR4 fused with a partner LRR protein at its C-terminus. 
(b) Alternatively, the partner protein can be fused to the N-terminal 
region of TLR4. 

simple-minded fusion of the two proteins was surprisingly 
successful, with a success rate exceeding 50%. In a typical 
fusion protein procedure in molecular biology, the two 
proteins are connected by a flexible linker. As a result, the 
hydrophobic cores of the two proteins remain separate and 
their tendency to crystallize is reduced due to the flexibility of 
the linker. However, the crystallization behavior of hybrid LRR 
proteins is often superior because the hydrophobic cores of the 
two proteins are connected directly and the resulting hybrid 
has a rigid structure. Because one can generate a practically 
unlimited number of hybrid proteins for a single target protein 
by switching partner proteins, some of them should be able to 
crystallize even though the original target is not. This 
technique was successfully applied to the crystallographic 
study of TLRs 1, 2 and 4-6 described in this study, and to 
RP105, a close relative of the TLRs.^^'^^'^^'^^'^^ 

CONCLUSIONS AND PERSPECTIVES 

LPS is a potent stimulator of the innate immune response. Its 
structure and immunological activity has been studied for over 
a century. In the last decade, crystal structures of most of the 
receptors and accessory proteins involved in LPS recognition 
have been determined. These structures explain how our 
immune system recognizes the common structural pattern in 
diverse LPS molecules. They also provide an initial clue as to 




Experimental & Molecular Medicine 



LPS recognition by TLR4 

BS Park and J-0 Lee 



how LPS binding initiates the aggregation of intracellular 
signaling proteins that leads to cytokine production and the 
initiation of inflammatory signaling. TLR4 is the key receptor 
involved in LPS recognition and signal initiation. Up to now, 
the structures of six of the ten human TLRs in complex with 
their cognate ligands have been determined. Although the 
locations and structures of the ligand interaction sites of these 
TLRs differ greatly, they all form m-shaped dimers with the 
two C-termini in the center, and the N-termini on the outside. 
All this structural information should contribute to the 
development of novel therapeutic agents that either attenuate 
or enhance TLR-mediated signaling. 
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